Visual Forecasting by Imitating Dynamics in Natural Sequences

机译：通过模拟自然序列中的动力学进行视觉预测

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We introduce a general framework for visual forecasting, which directlyimitates visual sequences without additional supervision. As a result, ourmodel can be applied at several semantic levels and does not require any domainknowledge or handcrafted features. We achieve this by formulating visualforecasting as an inverse reinforcement learning (IRL) problem, and directlyimitate the dynamics in natural sequences from their raw pixel values. The keychallenge is the high-dimensional and continuous state-action space thatprohibits the application of previous IRL algorithms. We address thiscomputational bottleneck by extending recent progress in model-free imitationwith trainable deep feature representations, which (1) bypasses the exhaustivestate-action pair visits in dynamic programming by using a dual formulation and(2) avoids explicit state sampling at gradient computation using a deep featurereparametrization. This allows us to apply IRL at scale and directly imitatethe dynamics in high-dimensional continuous visual sequences from the raw pixelvalues. We evaluate our approach at three different level-of-abstraction, fromlow level pixels to higher level semantics: future frame generation, actionanticipation, visual story forecasting. At all levels, our approach outperformsexisting methods.

机译：我们介绍了视觉预测的一般框架，该框架直接模拟视觉序列而无需其他监督。结果，我们的模型可以应用于多个语义级别，并且不需要任何领域知识或手工制作的功能。我们通过将视觉预测公式化为逆强化学习（IRL）问题来实现这一目标，并根据其原始像素值直接模拟自然序列中的动态。关键的挑战是高维且连续的状态作用空间，该空间阻止了以前的IRL算法的应用。我们通过使用可训练的深层特征表示法扩展无模型模仿的最新进展来解决此计算瓶颈，（1）通过使用对偶公式绕过动态规划中的穷举状态-作用对访问，（2）避免使用深度特征重新参数化。这使我们能够大规模应用IRL，并直接从原始像素值中模拟高维连续视觉序列中的动态。我们从三种不同的抽象层次（从低层像素到高层语义）评估我们的方法：将来的帧生成，动作预期，视觉故事预测。在所有层面上，我们的方法都优于现有方法。

著录项

作者
Zeng, Kuo-Hao; Shen, William B.; Huang, De-An; Sun, Min; Niebles, Juan Carlos;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. A Generalized Method to Extract Visual Time-Sharing Sequences From Naturalistic Driving Data [J] . Christer Ahlstrom, Katja Kircher Intelligent Transportation Systems, IEEE Transactions on . 2017,第11期

机译：从自然驾驶数据中提取视觉分时序列的通用方法
2. Enhanced learning of natural visual sequences in newborn chicks [J] . Wood Justin N., Prasad Aditya, Goldman Jason G., Animal Cognition . 2016,第4期

机译：增强新生雏鸡自然视觉序列的学习
3. Sign-changing filters similar to cells in primary visual cortex emerge by independent component analysis of temporally convolved natural image sequences [J] . Andras Lorincz, Botond Szatmary, Ata Kaban Neurocomputing . 2001,第期

机译：通过对时间卷积的自然图像序列进行独立成分分析，出现了类似于初级视觉皮层中细胞的符号变化过滤器
4. Visual Forecasting by Imitating Dynamics in Natural Sequences [C] . Kuo-Hao Zeng, William B. Shen, De-An Huang, IEEE International Conference on Computer Vision . 2017

机译：通过模仿自然序列动态的视觉预测
5. Silvicultural intensification in natural forests in Indonesia: Consequences for timber yields, carbon dynamics, tree species composition, and profits [D] . Ruslandi. 2016

机译：印度尼西亚天然林的造林集约化：木材产量，碳动态，树木种类组成和利润的后果
6. Independent component analysis of natural image sequences yields spatio-temporal filters similar to simple cells in primary visual cortex. [O] . J H van Hateren, D L Ruderman 1998

机译：自然图像序列的独立成分分析产生时空滤波器类似于初级视觉皮层中的简单细胞。
7. A Model of Dynamic Visual Attention for Object Tracking in Natural Image Sequences [O] . Ouerhani, Nabil, Hügli, Heinz 2008

机译：自然图像序列中目标跟踪的动态视觉注意力模型
8. Dynamic visualization techniques for high consequence software [R] . Pollock, G. M. 1998

机译：高后果软件的动态可视化技术

Visual Forecasting by Imitating Dynamics in Natural Sequences

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅